In mathematics, a quadratic equation is a polynomial equation of the second degree. The general form is
where x represents a variable, and a, b, and c, constants, with a ≠ 0. (If a = 0, the equation becomes a linear equation.)
The constants a, b, and c, are called respectively, the quadratic coefficient, the linear coefficient and the constant term or free term. The term "quadratic" comes from quadratus, which is the Latin word for "square". Quadratic equations can be solved by factoring, completing the square, graphing, Newton's method, and using the quadratic formula (given below).
One common use of quadratic equations is computing trajectories in projectile motion. Another common use is in electronic amplifier design for control of step response and stability.
Contents |
A quadratic equation with real or complex coefficients has two solutions, called roots. These two solutions may or may not be distinct, and they may or may not be real.
The roots are given by the quadratic formula
where the symbol "±" indicates that both
and |
are solutions of the quadratic equation.
There is also a shortened version of the quadratic formula which is commonly used when the coefficient of x is an even number:
In this case the solutions are given by:
In the above formula, the expression underneath the square root sign is called the discriminant of the quadratic equation, and is often represented using an upper case Greek delta:
A quadratic equation with real coefficients can have either one or two distinct real roots, or two distinct complex roots. In this case the discriminant determines the number and nature of the roots. There are three cases:
and |
Thus the roots are distinct if and only if the discriminant is non-zero, and the roots are real if and only if the discriminant is non-negative.
The solutions of the quadratic equation
are also the roots of the quadratic function:
since they are the values of x for which
If a, b, and c are real numbers and the domain of f is the set of real numbers, then the roots of f are exactly the x-coordinates of the points where the graph touches the x-axis.
It follows from the above that, if the discriminant is positive, the graph touches the x-axis at two points, if zero, the graph touches at one point, and if negative, the graph does not touch the x-axis.
The term
is a factor of the polynomial
if and only if r is a root of the quadratic equation
It follows from the quadratic formula that
In the special case () where the quadratic has only one distinct root (i.e. the discriminant is zero), the quadratic polynomial can be factored as
Certain higher-degree equations can be brought into quadratic form and solved that way. For example, the 6th-degree equation in x:
can be rewritten as:
or, equivalently, as a quadratic equation in a new variable u:
where
Solving the quadratic equation for u results in the two solutions:
Thus
Concentrating on finding the three cube roots of 2 + 2i – the other three solutions for x will be their complex conjugates – rewriting the right-hand side using Euler's formula:
(since e2kπi = 1), gives the three solutions:
Using Eulers' formula again together with trigonometric identities such as cos(π/12) = (√2 + √6) / 4, and adding the complex conjugates, gives the complete collection of solutions as:
and
The Babylonian mathematicians, as early as 2000 BC (displayed on Old Babylonian clay tablets) could solve a pair of simultaneous equations of the form:
which are equivalent to the equation:[1]
The original pair of equations were solved as follows:
There is evidence pushing this back as far as the Ur III dynasty.[3]
In the Sulba Sutras in ancient India circa 8th century BC quadratic equations of the form ax2 = c and ax2 + bx = c were explored using geometric methods. Babylonian mathematicians from circa 400 BC and Chinese mathematicians from circa 200 BC used the method of completing the square to solve quadratic equations with positive roots, but did not have a general formula.
Euclid, the Greek mathematician, produced a more abstract geometrical method around 300 BC. Pythagoras and Euclid used a strictly geometric approach, and found a general procedure to solve the quadratic equation.[4] In his work Arithmetica, the Greek mathematician Diophantus solved the quadratic equation, but giving only one root, even when both roots were positive.[5]
In 628 AD, Brahmagupta, an Indian mathematician, gave the first explicit (although still not completely general) solution of the quadratic equation
as follows:
“ | To the absolute number multiplied by four times the [coefficient of the] square, add the square of the [coefficient of the] middle term; the square root of the same, less the [coefficient of the] middle term, being divided by twice the [coefficient of the] square is the value. (Brahmasphutasiddhanta (Colebrook translation, 1817, page 346)[2] | ” |
This is equivalent to:
The Bakhshali Manuscript written in India in the 7th century AD contained an algebraic formula for solving quadratic equations, as well as quadratic indeterminate equations (originally of type ax/c = y).
Muhammad ibn Musa al-Khwarizmi (Persia, 9th century), inspired by Brahmagupta, developed a set of formulas that worked for positive solutions.[4] Al-Khwarizmi goes further in providing a full solution to the general quadratic equation, accepting one or two numerical answers for every quadratic equation, while providing geometric proofs in the process.[6] He also described the method of completing the square and recognized that the discriminant must be positive,[7] which was proven by his contemporary 'Abd al-Hamīd ibn Turk (Central Asia, 9th century) who gave geometric figures to prove that if the discriminant is negative, a quadratic equation has no solution.[8] While al-Khwarizmi himself did not accept negative solutions, later Islamic mathematicians that succeeded him accepted negative solutions,[9] as well as irrational numbers as solutions.[10] Abū Kāmil Shujā ibn Aslam (Egypt, 10th century) in particular was the first to accept irrational numbers (often in the form of a square root, cube root or fourth root) as solutions to quadratic equations or as coefficients in an equation.[11]
The Jewish mathematician Abraham bar Hiyya Ha-Nasi (12th century, Spain) authored the first European book to include the full solution to the general quadratic equation.[12] His solution was largely based on Al-Khwarizmi's work.[13] The writing of the Chinese mathematician Yang Hui (1238-1298 AD) represents the first in which quadratic equations with negative coefficients of 'x' appear, although he attributes this to the earlier Liu Yi. The first appearance of the general solution in the modern mathematical literature is evidently in an 1896 paper by Henry Heaton.[14]
By 1545 Gerolamo Cardano compiled the works related to the quadratic equations. In 1637, when René Descartes published La Géométrie, modern mathematics was born, and the quadratic formula has adopted the form we know today.[4]
The quadratic formula can be derived by the method of completing the square,[15] so as to make use of the algebraic identity:
Dividing the quadratic equation
by a (which is allowed because a is non-zero), gives:
or
The quadratic equation is now in a form to which the method of completing the square can be applied. To "complete the square" is to add a constant to both sides of the equation such that the left hand side becomes a complete square:
which produces
The right side can be written as a single fraction, with common denominator 4a2. This gives
Taking the square root of both sides yields
Isolating x, gives
The quadratic formula can be derived by starting with equation
which describes the parabola as ax2 with the vertex shifted from the origin to (xV, yV).
Solving this equation for x is straightforward and results in
Using Viète's formulas for the vertex coordinates
the values of x can be written as
Note. The formulas for xV and yV can be derived by comparing the coefficients in
and
Rewriting the latter equation as
and comparing with the former results in
from which Viète's expressions for xV and yV can be derived.
An alternative way of deriving the quadratic formula is via the method of Lagrange resolvents, which is an early part of Galois theory.[16] A benefit of this method is that it generalizes to give the solution of cubic polynomials and quartic polynomials, and leads to Galois theory, which allows one to understand the solution of polynomials of any degree in terms of the symmetry group of their roots, the Galois group.
This approach focuses on the roots more than on rearranging the original equation. Given a monic quadratic polynomial
assume that it factors as
Expanding yields
where
and
Since the order of multiplication does not matter, one can switch α and β and the values of p and q will not change: one says that p and q are symmetric polynomials in α and β. In fact, they are the elementary symmetric polynomials – any symmetric polynomial in α and β can be expressed in terms of and αβ. The Galois theory approach to analyzing and solving polynomials is: given the coefficients of a polynomial, which are symmetric functions in the roots, can one "break the symmetry" and recover the roots? Thus solving a polynomial of degree n is related to the ways of rearranging ("permuting") n terms, which is called the symmetric group on n letters, and denoted For the quadratic polynomial, the only way to rearrange two terms is to swap them ("transpose" them), and thus solving a quadratic polynomial is simple.
To find the roots α and β, consider their sum and difference:
These are called the Lagrange resolvents of the polynomial; notice that these depend on the order of the roots, which is the key point. One can recover the roots from the resolvents by inverting the above equations:
Thus, solving for the resolvents gives the original roots.
Formally, the resolvents are called the discrete Fourier transform (DFT) of order 2, and the transform can be expressed by the matrix with inverse matrix The transform matrix is also called the DFT matrix or Vandermonde matrix.
Now is a symmetric function in α and β, so it can be expressed in terms of p and q, and in fact as noted above. Contrariwise, is not symmetric, since switching α and β yields (formally, this is termed a group action of the symmetric group of the roots). Since is not symmetric, it cannot be expressed in terms of the roots p and q, as these are symmetric in the roots and thus so is any polynomial expression involving them. However, changing the order of the roots only changes by a factor of and thus the square is symmetric in the roots, and thus expressible in terms of p and q. Using the equation
yields
and thus
If one takes the positive root, breaking symmetry, one obtains:
and thus
Thus the roots are
which is the quadratic formula. Substituting yields the usual form for when a quadratic is not monic. The resolvents can be recognized as being the vertex, and is the discriminant (of a monic polynomial).
A similar but more complicated method works for cubic equations, where one has three resolvents and a quadratic equation (the "resolving polynomial") relating and which one can solve by the quadratic equation, and similarly for a quartic (degree 4) equation, whose resolving polynomial is a cubic, which can in turn be solved. However, the same method for a quintic equation yields a polynomial of degree 24, which does not simplify the problem, and in fact solutions to quintic equations in general cannot be expressed using only roots.
In some situations it is preferable to express the roots in an alternate form.
This alternative requires c to be nonzero; for, if c is zero, the formula correctly gives zero as one root, but fails to give any second, non-zero root. Instead, one of the two choices for ∓ produces the indeterminate form 0/0, which is undefined. However, the alternative form works when a is zero (giving the unique solution as one root and division by zero again for the other), which the normal form does not (instead producing division by zero both times).
The roots are the same regardless of which expression we use; the alternate form is merely an algebraic variation of the common form:
The alternative formula can reduce loss of precision in the numerical evaluation of the roots, which may be a problem if one of the roots is much smaller than the other in absolute magnitude. The problem of c possibly being zero can be avoided by using a mixed approach:
Here sgn denotes the sign function.
Dividing the quadratic equation by a gives the simplified monic form of
where p = b⁄a and q = c⁄a. This in turn simplifies the root and discriminant equations somewhat to
(Cf. completing the square and the Lagrange resolvents derivation methods above.)
A careful floating point computer implementation differs a little from both forms to produce a robust result. Assuming the discriminant, b2 − 4ac, is positive and b is nonzero, the code will be something like the following:[17]
Here sgn(b) is the sign function, where sgn(b) is 1 if b is positive and −1 if b is negative; its use ensures that the quantities added are of the same sign, avoiding catastrophic cancellation. The computation of x2 uses the fact that the product of the roots is c/a.
Viète's formulas give a simple relation between the roots of a polynomial and its coefficients. In the case of the quadratic polynomial, they take the following form:
and
These results follow immediately from the relation:
which can be compared term by term with:
The first formula above yields a convenient expression when graphing a quadratic function. Since the graph is symmetric with respect to a vertical line through the vertex, when there are two real roots the vertex’s x-coordinate is located at the average of the roots (or intercepts). Thus the x-coordinate of the vertex is given by the expression:
The y-coordinate can be obtained by substituting the above result into the given quadratic equation, giving
As a practical matter, Viète's formulas provide a useful method for finding the roots of a quadratic in the case where one root is much smaller than the other. If |x 1| << |x 2|, then x 1 + x 2 ≈ x 1, and we have the estimate:
The second Viète's formula then provides:
These formulas are much easier to evaluate than the quadratic formula under the condition of one large and one small root, because the quadratic formula evaluates the small root as the difference of two very nearly equal numbers (the case of large b), which causes round-off error in a numerical evaluation. The figure shows the difference between (i) a direct evaluation using the quadratic formula (accurate when the roots are near each other in value) and (ii) an evaluation based upon the above approximation of Viète's formulas (accurate when the roots are widely spaced). As the linear coefficient b increases, initially the quadratic formula is accurate, and the approximate formula improves in accuracy, leading to a smaller deference between the methods as b increases. However, at some point the quadratic formula begins to lose accuracy because of round off error, while the approximate method continues to improve. Consequently the difference between the methods begins to increase as the quadratic formula becomes worse and worse.
This situation arises commonly in amplifier design, where widely separated roots are desired to insure a stable operation (see step response).
The formula and its derivation remain correct if the coefficients a, b and c are complex numbers, or more generally members of any field whose characteristic is not 2. (In a field of characteristic 2, the element 2a is zero and it is impossible to divide by it.)
The symbol
in the formula should be understood as "either of the two elements whose square is b2 − 4ac, if such elements exist". In some fields, some elements have no square roots and some have two; only zero has just one square root, except in fields of characteristic 2. Note that even if a field does not contain a square root of some number, there is always a quadratic extension field which does, so the quadratic formula will always make sense as a formula in that extension field.
In a field of characteristic 2, the quadratic formula, which relies on 2 being a unit, does not hold. Consider the monic quadratic polynomial
over a field of characteristic 2. If b = 0, then the solution reduces to extracting a square root, so the solution is
and note that there is only one root since
In summary,
See quadratic residue for more information about extracting square roots in finite fields.
In the case that b ≠ 0, there are two distinct roots, but if the polynomial is irreducible, they cannot be expressed in terms of square roots of numbers in the coefficient field. Instead, define the 2-root R(c) of c to be a root of the polynomial x2 + x + c, an element of the splitting field of that polynomial. One verifies that R(c) + 1 is also a root. In terms of the 2-root operation, the two roots of the (non-monic) quadratic ax2 + bx + c are
and
For example, let a denote a multiplicative generator of the group of units of F4, the Galois field of order four (thus a and a + 1 are roots of x2 + x + 1 over F4). Because (a + 1)2 = a, a + 1 is the unique solution of the quadratic equation x2 + a = 0. On the other hand, the polynomial x2 + ax + 1 is irreducible over F4, but it splits over F16, where it has the two roots ab and ab + a, where b is a root of x2 + x + a in F16.
This is a special case of Artin-Schreier theory.